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CHAPTER TWO 
DIVIDE AND CONQUER 


2.1. General Methodology 


Given a function to compute on n inputs the divide and conquer strategy 
suggests splitting the inputs into k distinct subsets,1<ksn, yielding k 
subproblems. These subproblems must be solved, and then a method must be 
found to combine sub-solutions into a solution of whole. If the subproblems 
are still relatively large, then the divide and conquer strategy can 
possibly reapplied. 


* Often the subproblems resulting from divide and conquer design are of 
the same type as the original problem. The principle is naturally expressed 
using recursive algorithm. that is smaller and smaller subproblems of the 
same kind are generated until eventually subproblems that are small enough 
to be solved without splitting are produced. 


Generally, the divide and conquer strategy solves a problem by: 


1. Breaking it into subproblems that are themselves smaller instance of 
the same type of problem. 

2. Reclusively solving these subproblems. 

3. Appropriately combining their answers. 


~ The real work is done piecemeal, in three different places: in the 
partitioning of problem into subproblems; at the very tail end of the 
recursion - when the subproblems are so small that they are solved outright; 
and in the gluing together of partial answers. These are held together and 
coordinated by the algorithm’s core recursive structure. 


Below algorithm DAndC is initially invoked as DandC(p), where p is the 
problem to be solved. 


Small(p) is a Boolean-valued function that determines whether the input 
size is small enough that the answer can be computed without splitting. If 
this is so, the function S is invoked. Otherwise, the problem p is divided 
into smaller subproblems. 


These subproblems P1, P2, .., Pk are solved by recursive applications of 
DandC. Combine is a function that determines the solutions to P using the 
solutions to the k subproblems. 
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Algorithm DandC(p) 


{ 
if small(p) then return s(p); 
else 
divide p into smaller instances pl, p2, p3, .., p(k), k>=1; 
apply DandC to each of these subproblems; 
return Combine(DandC(p1), DandC(p2), .., DandC(pk)); 
} 


Then, the computing time of DandC is described by recurrence relation 
=1 
T(n) = ee n 
T(n)+T(n)+f(n) n>1 


Where, T(n) is time for any input of size n. g(n) is the time to compute 
the answer directly for small problem inputs and f(n) is time for dividing 
and combining the solutions for subproblems. 


The complexity of may divide and conquer algorithms is given by recurrence 
relations of the form; 


T(1) n=1 
A ie {ar (*) + f(n) n>1 


a power of b. 


a and b are constants. T(1) is known and n is 


Method for solving any such recurrence relation is called substitution 
method. Substitution methods makes substitutions for each occurrence of the 
function T in the right-hand side until all such occurrence disappear. 


* Finding Minimum and Maximum 


The aim is finding the minimum and maximum item in a set of n elements. See 
examples below; 


Algorithm StraightMM(a, n, max, min) 
{ 
max:=min:=a[1]; 
for i:=2 to n do 
if ali] > max then 
max:=a[i]; 
if ali] < min then 
min:=a[i]; 
} 
e In this algorithm, comparisons dominate execution time over statements 
of operations. This happens when we compare large or complex values 
like vectors, polynomials and more. Considering this we will calculate 
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time complexity in terms of number of comparisons made to find max 
and min values. 

e The two if conditions will run 2(n-1) comparisons in worst, average 
and best cases. 


* We can improve this algorithm, by changing the last if, into else if 
condition because we know that any item greater than max is not a min 
value. 


Algorithm StraightMM(a, n, max, min) 


{ 
max:=min:=a[1]; 
for i:=2 to n do 
if ali] > max then 
max:=a[i]; 
else 
if a[i] < min then 
min:=a[i]; 
} 


Now, best case occurs when elements are in increasing order, that is n-1 
comparisons. Worst case occurs when elements are in decreasing order that 
needs to check both conditions. 2(n-1) comparisons. 


To apply divide and conquer; 


Let, P=(n, a[i],..., a[j]) denotes an arbitrary instances of the problem. 
Here n is the number of elements in the list a[i],..., a[j]. 


Now, Small(p) is true when n<=2. Means that, min=max=a[i] if n=1, and if 
n=2, can be solved by one comparison. 


If n>=2, P can be divided into subproblems. If we divide P into two 
instances; 


P1=(Ln/2J, a[1], ..., a[L n/2J]) and P2=(n- Ln/2J, a[Ln/2J+1],..., a[n]). 


e If max(P) and min(P) are largest and smallest elements in P 
respectively, then min(P) is smaller of min(P1) and min(P2), likewise 
max(P). 

e Invoke the this algorithm for the first time as MinMax(1, n, max, min) 
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Algorithm MinMax(k, j, max, min) 


{ 
if k==j then min:=max:=a[k]; //n=1 
else if (k=j-1) then {in=2 
if a[k] < a[j] then 
max:=a[j]; min:=a[k]; 
else 
max:=a[k]; min:=a[j]; 
else //n>2 
//diving stage 
mid = L(k+j)/2]; 
MinMax(k, mid, max, min); 
MinMax(mid+1, j, max1, min1); 
//combining phase 
if max1 > max then max:=max1; 
if mini < min then min:=min1; 
} 
E.g. 


A [1] [2] [3] [4] [5] [6] [7] [8] [9] 
4 6 10 1 -8 20 3 17 -10 
By finding mid value, we can construct a tree to see executions and to trace 
return order of the recursion. From the following recursion return order 
tree; when n=2, [k, j] and when n>2, we will find mid value, [k, mid, j]. 


For example, to find the first min and max, start from the left leaves of 
the tree. Compare elements at index of 1 and 2. That is A[1] and A[2]. So, 
for this min=4 and max=6. Now, the min and max are equal at leaf 3, that is 
min=max=A[3] = 10. At step 5, compare min and max of the two child nodes. 
min = 4 and max = 10. Now, it is your turn to complete and find min and max 
for the whole problem. 


9 [1,5,9] 
/ \ 
5 8 
[1,353] [6,7,9] 
/ \ / 5," 
3 4: 6 7 
[1,2,3] [4,5] [6,7] [8,9] 


/ \ ; 
[1,2] [3,3] 
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Let A be an array of size n=9; and numbers above the table cells are indexes. 
Now, A[4] = 1. Since n > 2, we divide the array into two halves. 


If T(n) represents this number, then the resulting recurrence relation is 
where n is a power of two, n = 2* for some positive integer k, then 


0 n=1 
T(n) = 41 n=2 
T (n/2) +T(n/2) + 2 n>2 


What is number of comparisons needed then? 


0 n=1 

T(n) = 41 n=2 
n 

2T(5)+2 n>2 


T(n) = 2T(n/2) + 2 


2[2T(n/4) + 2] + 2, dividing the n/2 into twos. 


4T(n/4) +4 + 2 


4[2T(n/8) + 2] +4 +2 


8T(n/8) +8+4+4+2 


QIOE Tf 26) DBE a DS he as de 


NB: n is factor of two. If we divide n for 2 subproblems we will get 1, 
since 2% equals n. Meaning that the subproblems contain single element, 


with zero comparison or T(1). If we stop dividing at 2-1, then the 
subproblems contains two elements, with one comparison or T(2). 


T(m) = 2k-1 T(m/2k-2) 4 2k-2 4 22 + 0, + 21, T(n/2k)) = T(2) = 1. 

T(n) = 262 102) 43 -genteeg a 27 

T(n) = 2k-1 4 2k - 2 

T(n) = n/2 +n - 2 => 3n/2-2 in all cases; best, average, and worst. 


This algorithm takes 25% less time than StraightMM, 2n-2. MinMax is 
optimal in time complexity, but it does not mean better in practice 
because, it worse in storage space required for storing k, j, min, max, 
mini, max1 for each recursive call. 
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* Merge Sort 

Merge sort is an example of divide and conquer algorithm. Its worst-case 
complexity is O(nlogn). To understand merge sort, assume that elements 
are arranged in the non-decreasing order. 

Assume a sequence of n elements a[1], a[2],...a[n]. Split them into two 
sets like a[1], a[2],..a[Ln/2J] and a[L(n/2)1]+1],...a[n]. Each set is 
individually sorted, and the resulting sorted sequences are merged to 


produce a single sorted sequence of n elements. 


Merge-sort is based on the divide-and-conquer paradigm. The Merge-sort 
algorithm can be described in general terms as consisting of the 
following three steps: 
1. Divide Step: If given array A has zero or one element, it is already 
sorted. Otherwise, divide A into two arrays, each containing about 
half of the elements of A. 
2. Recursion Step: Recursively sort array Al and A2. 
3. Conquer Step: Combine the elements back in A by merging the sorted 
arrays Al and A2 into a sorted sequence. 
If the time for the merging operation is proportional to n, then the 
computing time for merge sort is described by the recurrence relation; 


a n=1 
T(n) = or @+en | for some constant a and c. cn time taken 


to divide and combine the two halves (n/2). When n is a power of 2, n=2k 
we can solve this equation by successive substitutions: 


T(n) = 2T(n/2) + cn 
= 2[2T(n/4) + cn/2] + cn, substituting n/4, or dividing n/2 into twos. 
= 4T(n/4) + cn + cn, again let’s divide n/4 into twos. 
= 4[2T(n/8) + cn/4] + cn + cn 


= 8T(n/8) + 3cn, by keep on dividing until we can’t divide anymore, 
we will reach at tree level k. Were all subproblems are size of 1. 


= 2kT(n/2k) + ken. When n = 2k, n/2k = 1. And T(1) is a, from above. 
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2‘T(1) + kcn = na + ken => an + ken 
= an + cn (logn), k is height of the tree. 


= dropping minimal terms,and coefficients T(n) = O(nlogn). Read for 
more detail on this. 


Algorithm MergeSort(low, high) 


{ 
if low < high then 
mid = L(low + high)/2]; 
MergeSort(low, mid); 
MergeSort(mid+1, high); 
Merge(low, mid, high); 
} 


Merge algorithm takes two subparts and combine them. 


Algorithm Merge(low, mid, high) 


{ 
h:=low; i:=low; j:=mid+1; 
while h<=mid and j<=high do 
if a[h] <= a[j] then 
b[i]:=a[h]; h:=h+1; 
else 
b[i]:=a[j]; j:=j+1; 
i:=i+1; 
if h > mid then 
for k:=j to high do 
b[i]:=a[k]; i:=1i+1; 
else 
for k:=h to mid do 
b[i]:=a[k]; i:=1i+1; 
for k:=low to high do 
a[k]:=b[k]; 
} 


e Analysis of Merge Sort 


We can think of the three steps (divide, conquer and combine) separately, 
so that we can have a clear view of what is going on; The divide step is 
finding mid value, a simple step to be executed at constant time, O(1). At 
the combine step, merges a total of n element, O(n). So, we can say that 
dividing and combining takes O(1) + O(n) time, which is O(n). 
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To make things more concrete, let’s say that divide and combine steps 
together take cn time for some constant c. See from the following diagram; 
MergeSort on ann element subarray as being the sum of twice the running 
time of MergeSort on an n/2-element subarray (for conquer step) plus cn 
(for divide and combine steps). 


Now we have to figure out the running time of two recursive calls on n/2 
elements. Each of these two recursive calls takes twice of the running time 
of MergeSort on an n/4 element subarray. We have two subproblems of size 
n/2, and each takes cn/2 time to merge, and so the total time we spend 
merging for subproblems of size n/2 is 2*cn/2 = cn. 


Subproblems Total Merging time for all 
Size subproblems of this size 
n Cn | 
; * | 
- \ | 
n/2 n/2 2 (cn/2) | 
/ \ 7 * | = on 
n/4 n/4 n/4 n/4 4(cn/4) | 
« * ‘ ‘ ; | 
« * ‘ ‘ | 
RREBRDEDA 2 eae Ps n(cn/n) | 
| 
n 


What do you think would happened for the subprobLlems of n/8? Well, there 
will be eight of them (the subproblems), and the merging time for each 
will be cn/8. So, total merging time is 8*cn/8 = cn. 

As subprobLlems get smaller, number of subproblems doubles at each Level 
of the recursion, but the merge time halves. 

The total running time of MergeSort is the sum of merging times for 
levels. Read the highlighted text below; 

Whenever we divide a number into half in every step, it can be represented 
using a Logarithmic function, which is Logn and the number of steps 
(height of tree) can be represented by Logn + 1(at most). +1 happen when 


the Last non-terminal node has only one child. 
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The total time of merge sort is 1*cn. Where 1 = logn + 1 (height of the 


tree, or say maximum level the tree divides until it cannot be divided). 


T(n) = cn(logn + 1) = cnlogn + cn, after dropping term cn and coefficient 


c, running time is O(nlogn) in all scenarios. 


E.g., The tree structure of Merge and MergeSort are given as follows; 


[1,9] 
/ \ 
[1,5] [6,9] 
, X% /  \ 
[1,3] [4,5] [6,7] [8,9] 
; - fs / \ 
[1,2] [3] [4] [5] [6] [7] [8] [9] 
po 
[1] [2] 
Ss Tree of Calls of MergeSort(low, high) ----- 


Liyg tye] 
| 
[1,2,3] [4,4,5] [6,6,7] [8,8,9] 
\ / \ f 
\ / \ / 
[1,3,5] [6,7,9] 
\ / 
\ / 
‘ / 
" / 
[1,5,9] 
----- Tree of calls of Merge (low,mid, high) ---- 
| 


Now, you can any array of size n, to test the correctness of the step. But, 
unlike MergeSort, this merging tree works down, you start merging from index 
of [1,1,2]. 
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